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Attorney Docket No. 1336 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant: Philip Richard Martin Date: .Dec. 10. 2002 

Serial No.: 09/759,758; Gr^up Art Unit: 1638' 

Filed: January 12. 2001 Examiner: David T. Fox 

For '-INBRED MAIZE LINE PH5HK" 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

RULE 132 DECLARATION 
OF 

DR. STEPHEN SMITH 

Sir: 

I, Stephen Smith. PhD., do hereby declare and say as follows: 

1 . I am skilled in the art of the field of the invention. I have a Ph.D. in Biochemical 
Systematics and Taxonomy of Maize and its Wild Relatives from Birmingham Univeij^ity. 
1 have a M.Sc. in the Conservation and Utilization of Plant Genetic Resources from 
Birmingham University. I have a Bachelor of Science degree in Plant Sciences from 
London University. Sinc^ 1 977 I have been engaged in the development, study and 
application of molecular markers to genetics, measuring genetic diversity and tracking 
pedigrees. I commenced this work at North Carolina State University as a ppst-doctoral 
research fellow. I have continued my engagement in these studies during my ^ 
employment by Pioneer Hi-Bred from 1980 until the present These studies have 
resulted in numerous scientific articles that have appeared in peer reviewed scientific, 
literature. 

2. I have read and uriWerstood the Office Action in the above case dated October^ 
30, 2002. This declaration is in response to the Examiner's rejection under, 35 U.S.C. § 
102(e) as anticipated by or. ir, the alternative, under 35 U S.C. § 103<3) as_obv.ous over 
Cummings et a!. (U.S. Patent No. 5,977,455). .: , 

3. I have conducted an analysis of SSR marker data for inbred PH5HK and ths 
inbred cited as prior art WKBC5. Out of a total of 74 SSR loci examined, which allowed 
a sampling of each chromosome, there are 46 markers that show differences between 
PH5HK and WKBCS. rh,s represents a difference for 62% for the markers tested Of 
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these 46 markers, 23 were greater than 50 cM in distance, or unlinked on the genetic 
map. 

4. Upon crossing PH5HK to any other maize line and selfing successive filial 
generations, one would within the realm of what is statistically possible, obtain a progeny 
inbred maize line that retains genetic contribution from PH5HK. Assuming thatd) the 
cited prior art is used as the maize line to which PH5HK is crossed, (ii) that the only ■ 
difference between PH5HK and WKBC5 are these 46 markers. and,(iii) that all marker^ 
within a 50 cM distance will segregate together, then the odds of obtaining^ PH54H 
progeny inbred that is the same as WKBC5 after one cycle of breeding, is 1 in 2^ or 1 
in 8,388.603. Statistically it is extremely unlikely that a PH5HK progeny, after one cycle 
of breeding, would be the same as WKBC5. 

5. Further, the assumptions made above vastly overstate the likelihood of breeding 
PH5HK from WKBC5. For example, it is common practice in quantitative genetics to 
detemiine the relation of plants by differences in markers. In doing so, one extrapolates 
that a percentage difference in markers is indicative of a difference in the whole genome. 
To assume that the only differences between PH5HK and WKBC5 are for these 46 
markers, when 46 markers constitute 62% of the 74 SSR loci examined, is a gross and 
unrealistic assumption. Further the current maize genetic map only has approximately 
sixty 50cM units, so by applying this limitation the maximum number of independently 
segregating loci one could obtain. Lising the most different maize lines that could ever be 
found, is sixty. These assumptions result in an over estimate of the odds of breeding 
PH5HKfromWKBC5. 

6. Given the difference in molecular markers between PH5HK and WKBC5, it is my 
expert opinion that PH5HK and WKBC5 are very distinct inventions. It is also my,expert 
opinion that, within the realm of what is statistically possible, any progeny of PH5HK 
developed through crossing PH5HK with another plant will be distinct from WKBC5. 
Given the facts and based on my education and scientific experience. I believe that the 
invention as claimed is not obvious nor anticipated by Cummings et al. (U.S. Patent No 
5,977.455). 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be tnje; and 
further that these statements were made with the knowledge that willful false statements 
and the like are punishable by fine or imprisonment, or both, under Section 1001 of Title 
18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 
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Attorney Docket No. 1335 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Philip Richard Martin - Date; Febru^ 28, 2003 

Serial No.: 09/759,758 Group Art Unit: 1638 -- 

Filed: January 12, 2001 . Examiner: David T. Fox 

For: "INBRED MAIZE LINE PH5HK" 

Assistant Connnnissioner for Patents 
Washington, D.C. 20231 

RULE 132 DECLARATION 
OF 

DR. STEPHEN SMITH 

Sir: 

I, Stephen Smith, PhD., do hereby declare and say as follows: 

1 . I am skilled in the art of the field of the invention. I have a Ph.D. in Biochemical 
Systematics and Taxonomy of Maize and its Wild Relatives from Birrriingham University. I 
have a M.Sc. in the Consen/ation and Utilization of Plant Genetic Resources from 
Birmingham University. I have a Bachelor of Science degree in Plant Sciences from London 
University. Since 1977 I have been engaged in the development, study and application of 
molecular markers to genetics, measuring genetic diversity and tracking pedigrees. I _ 
commenced this work at NoHh Carolina State University as a post-doctoral research fellow. I 
have continued my engagement in these studies during my employment by Pioneer Hi-Bred 
from 1980 until the present. These studies have resulted in numerous scientific articles that 
have appeared in peer reviewed scientific literature. 

2. I have read and understood the Office Action in the above case dated Octobec-SO, 
2002. This declaration is in response to the Examiner's rejection under, 35 U.S..G. § 112. first 
paragraph, as containing subject matter which was not described in the specification in such 
a way as to reasonably convey to one skilled in the relevant art that the inventor(s), at the 
time the application was filed, had possession of the claimed invention, 

3. I have conducted an analysis of Simple Sequence Repeat, SSR, marker data for base 
inbred PH5HK and three backcross conversions of PH5HK. Two backcross conversions 
were made with an herbicide resistance trait and one backcross conversion was made with a 
waxy starch trait. 

Appendix B 

Received from < 515 334 6883 > at 2128103 1 ;W:35 PM [Eastern Standard Time] 



32/28/03 FRI 13:54 FAX 515 334 6S83 PIONEER HI-BRED DSM Il039 

09/759,758 



4. The SSR data for 457 base inbreds and 1 03 backcross conversion inbreds, including 
PH5HK and the three backcross conversions were used in the analysis. The number of SSR 
markers for each inbred used in the analysis viras between 15 and 87 (mean of 82 ). The 
analysis was done as specified in the publication by Berry et al. ("Assessing Probability of 
Ancestry Using Simple Sequence Repeat Profiles: Applications to Maize Hybrids and 
Inbreds" Genetics 161:813-824. 2002), with modification as descibed in Berry et al;, ( 2003 ); 
Assessing Probability of Ancestry Using SSR Profiles: Application to maize inbred lines and 
soybean varieties Genetics (in review), a copy of which is attached hereto. 

5. The results of the analysis indicated that through the use of SSR markers PH5HK was 
identified to be the recurrent parent of each of the three backcross conversions of PH5HK 
over all the other inbreds in the data set. The probability associated with the Identification of 
PH5HK as the recurrent parent of each backcross conversion was calculated as 0.99 and 
higher in each case. 

6. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on infonnation and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like are 
punishable by fine or imprisonment, or both, under Section 1001 ofTitle 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. •' . - 
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Stephen Smith 
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ABSTRACT 

Determining parentage is a fundamental problem in biology and in applications such as 
identifying pedigrees. Difficulties inferring parentage denve from extensive inbreeding within 
the population, whether natural or planned; using an insufficient number of hypervariable loci; 
and from allele mis-matches caused by mutation or by laboratory errors that generate false - 
exclusions. Many studies of parentage have been limited to comparisons of small numbers of 
specific parent-progeny tnplets. There have been few large-scale surveys of candidates in which 
there is no prior knowledge of parentage. We present an algorithm that determines the . 
probability of parentage in circumstances where there is no prior knowledge of pedigree and 
which is robust in the face of missing data and mis-t>ped data. The focus is parentage of an h 
inbred line having uncertain ancestry. The algorithm is a variation of a previously published 
hybrid- focused algorithm. We describe the algorithm and demonstrate its performance in 
detenmining parentage of 43 inbred varieties of soybean that have been profiled using 236 SSR 
loci and from seven inbred varieties of maize that were profiled using 70 SSR loci. We include 
simulations of additional levels of missing and mis-typed data to show the algoritlmi's utility and 
flexibility. 
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The determination of parentage using molecular marker data has been little addressed for 
situations where there is httle or no pnor knowledge of parentage, or when large-scale surveys 
mvolving numerous candidate parents are required. Consequently, we have recently developed 
an algorithm and demonstrated its use m determimng probability of parentage for hybnds in 
circumstances where there is no prior knowledge of pedigree and which is robust in thc^ace of 
missing or mi5-t>T?ed data (Berry et al 2002). We now present a vanation of this aVgoritlim that 
allows determination of parentage for inbred lines or homozygous varieties.. 

We describe and evaluate a methodology that quantifies the probability of parentage of 
homozygous genotypes. Our algorithm takes into account that generations of self-pollination 
occur after the initial parental cross. The number of generations and the mitial parental genotypes 
are unknown. Each generation of inbreeding reduces the number of heterozygous loci m the 
progeny by an average of 50%. Thus, each of the inbred progeny individuals resulting from the 
initial parental cross will have lost approximately half of the parental alleles for loci where the 
inbred parents were fixed for alternate alleles and which were heterozygous in the Fl generation. 

Tl7e loss of parental alleles during the inbreeding phase is in contrast to the case of a hybrid. 
progeny. An inbred progeny individual will exhibit a lower level of allelic similarity to either of 
its inbred parents than a hybrid progeny will to its inbred parents. This loss of some parental 
alleles during inbreeding might be expected to make an inbred algorithm less robust in the face 
of missing or mis-typed data compared with the hybrid algorithm that has been previously 
described (Berry et al 2002). We therefore demonstrate the effectiveness and robustness of the 
inbred algorithm using examples from two species of cultivated plants. We first tested the 
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algorkhm using varieties of the naturally self-pollinating, inbred crop, soybean [Glycine max. 
(L) Merr.]. This crop was selected because numerous varieties of soybean with known pedigrees 
were available to us, many of which are closely related. We also used publicly bred inbreds of 
maize {Zea mays L.) that are of knoN\^ pedigree. Maize is naturally an outcrossing-species but 
inbred lines'are most usually generated for use as parents of commercial hybrids. Inbred liis£S are 
generated by making successive generations of self-pollination following the initial bi-parentaU 
cross. 

MATERIALS AND METHODS 

Algorithm: The algorithm is a variation of the hybrid version of Berry ei ai (2002). Consider an 
mdex inbred whose parentage is unknown or in dispute. A database containing possible inbred 
ancestors is available. The objective is to find the probabilities of closest ancestry for each inbred 
in the database using genotypic infoimation from a large number of SSRs. ' 

Consider a pair of possible ancestors, inbred / and inbred j. We calculate the probability that 
inbreds i and j are in the indexes ancestry, repeating this for'^all pairs of inbreds in the database. 
Let P{iJ\SSRs) stand for the posterior probability that i and j are ancestors of the index given the 
' information from the various SSRs. Let PQJ) stand for the uncondrtionaK(or prior) probability'of 
the same event and let P{SSRs\ij) be the probability of observing the various SSR results if in _ 
fact / and J are ancestors of the index. Just as in Berry et al. (2002), Bayes' rule relates these 
various probabilities: 
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Piij\SSRs) = P{SSRs\ijrP(iJ) I l[P{SSRs\u,vyPMl 

where the sum in the denominator is over all pairs of inbrcds in the database, indexed by u and v. 
We need to calculate P{SSRs\ij) for each / and J. We will make the '^no-prior-information" 
assumption that P{ij) is the same for all pairs (ij). Then P{u,v) is'a common multipleon the 
denominator that cancels with P{ij) in the numerator: ^ ' - ^ 

P{ij\SSRs) - P(SSRs\ij) I ZP{SSRs]u,v). 

The problem is to calculate a v/P'i^a\ P{SSRs\iJ), the probability of observing the index's SSRs 
assuming inbreds / and J are both ancestors. The nature of breeding before the self-pollination 
process is unknown. Since the creation of an inbred proceeds by multiple generations of self- 
pollination on a hybrid, we label the (unknown) hybrid used to create the (known) index inbred 
as the intermediate hybrid. WTien the intermediate hybrid is an immediate descendent of / andy, 
it receives one of Inbred f s alleles and one of inbred fs alleles. When the intermediate hybnd is 
a second generation descendent of z and y, it receives one allele from each with probability 0.5. 
And so on. Since degree of ancestry (if any) is unknown, we label the actual probability of 
passing on one of these alleles to the intermediate hybrid to be As in Berry et n/, .(2002) we 
consider p = 0.50 and p = 0.99 and he>e we also consider the intermediate value p = 0.75. - 

When inbreds / and j are ancestors then tliere are four possibilities: (l>the alleles of both i andy 
were passed to the intermediate hybrid, (2) i came through but not j\ (3) j came through but not 
and (4) neither came througli. Assuming independence, these have respective probabilities p' \ 
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p{ \-p), p{ \~p), {l-pf^ An allele in the intermediate hybrid's genotype that did not arise from 
either inbred / or inbred J is assumed to be selected with probability \/n, where n is the total 
number of alleles at the SSR in question. So far the steps we have described are identical to those 
for identifying the ancestors of a hybrid described by Berry et aL (2002) and, in fact, if the index 
is heterozygous at an SSR then calculations proceed just as for hybrids. Calculations are 
substantially different when the index inbred is homozygous, say genotype aa. Cases that must 
be considered are shown in Table 1, where x is any allele different from a (but not missing). All 
alleles other than a can be grouped because only a appears m the index's genotype. Forexample^ 
XT might be be or hd or bh, 

P(SSR\ij) is the probability of obsen-'ing the index assuming inbreds / and j are ancestors. The 
calculations for SSRs 1 to 6 are shown in Table 2, where the four terms in each case are in order 
gf (1), (2), (3), (4) defined in the previous paragraph. Missing alleles are not considered in the 
examples above. The number of possibilities is large. Here we consider only tlie case in which 
inbred / is aa and both alleles of inbred j are missmg. Then 

P(SSR\iJ) = p'(l/2^I/2'^!/n) + p(]-p)(]/2+l/n^l/2) - p(}-p)(I/n) + (I-p/(J/n) 

.Another possibility not considered above is that more than two alleles can be observed for an SSR 
marker run on individual DNA sample. This can be due to SSR locus duplication, homology due to 
alloploidy, more than one individual plant being sampled for DNA extraction or cross-contamination. In 
this case we consider all possible pairings of the observed alleles and calculate usmg a multiple 
imputation procedure (Little and Rubin, 1987). 
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To find the overall PiSSRs\i.j), multiply tlie individual P{SSR\iJ) over the various SSRs. To 
determine the probability that any particular inbred, say inbred z, is the closest ancestor of the 
index,^um P{SSR\i,v) over all inbreds v.with v 9^. Call this P{i\SSRs). The maximum of 
P{i\SSRs) for any inbred / is I . But since there is one closest ancestor omeach side of the faraHy, 
the sum of P{i\SSRs) over all inbreds i is 2. ^ 

SSR data: Soybean DNA was extracted from 490 varieties, all of which w.ere bred in, and are 
adapted to, the United States. Plant material for DNA extraction was sampled from six plants of ^ 
each variety. Most of the varieties are proprietary products of Pioneer Hi-Brcd International 
Several (non-patented) commercial varieties from other breedmg companies and some important 
publicly bred varieties were also included. Procedures for obtaining SSR data from soybean were 
identical to those described for maize by Berry et al. (2002) apart from the following 
modifications: PCR products with different size ranges and labeled with different fluorochiomes 
were pooled and diluted 1 :9 with capillary electrophoresis buffer (Apphed Biosystems) then 1 :4 ^ 
with dH20. l-5ul of pooled DNA were added to lOul formamide containing the molecular weight 
size standard 400HD ROX (Applied Biosystems, ROX = 6-carboxy-X-rhodamine). Fragment 
separation was performed using capillary electrophoresis on an ABI3700 platfonm (Applied 
Biosystems), with an injection time of 10 sec at 10,000 V and a run time of 4,000 sec at 7,500 V 
Forty-three soybean varieties that had both of their parent varieties also included in the dataser 
were assigned as index vaneties.'One to t\\'o and occasionally three.grandparent varieties of 
several of the index varieties were also included in the dataset. These varieties collectively 
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represent a broad array of diversity of soybean germplasm that is currently grovrTi in the United 
States. 

Two hundred and thirty-six publicly available soybean SSR markers = 
(http://soyba5e.agr0n.iastate.eda') were used to demonstrate and evaluate the algorithm. These — 
SSR markers were selected following initial screens on a subset of 24 soybean varieties in which 
they were tested for amplification and the ability to detect polymorphism. The 236 markers gave ^ 
good genome coverage and collectively mapped across each of the chromosomal linkage groups 
of soybean. 

All allele scores were made without knowing tlie idenlihes of the soybean, genotypes. 

Maize SSR data using 70 loci were previously reported by Senior et ai (1998) and were obtained 
directly from the first author. This publication (Senior et aL 1998) cites an array of 94 
historically important publicly bred lines that have well known and well established pedigrees. 
This array of public inbreds includes seven inbreds (A632, A634, Mol7, Pa9U Va35, Va99 and 
W64A) that each have SSR profiles for their parental lines included in the same dataset. Three of . 
these inbreds were developed from a breeding cross of two unrelated parents. These are: Mol7 
which was bred from the cross of C.I.i87-2 x C103; Va99, which was bred from the cross 
- OhOTB X Pa91 ; and W64A which was bred from the cross of WF9 x CT. 187-2. Other iribred 
progeny had more complex ped*igrees. One inbred (Va35) was bred from the cross C103 x T8 
following an additional cross of T8 as the recurrent parent. Two inbreds (A632 and A634) were 
bred from the cross Mt42 x B14 following additional crosses of B 14 as the recurrent parent, 

Q 
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Pa9I was bred from a complex cross involving four inbreds (WF9 x Oh40B) and (38-1 1 x 
L317). These seven progeny inbreds therefore provided an index set of maize Inbreds for 
evaluation of the inbred algorithm. 

RESULTS 

Data quality: The soybean SSR data that were used to evaluate the algorithm had a mean of 
5.5% (range 0-19% loci) missmg data per variety. For parent-progeny triplets, there was a mean 
of 1.1% loci (range 0-5%) where a progeny profile was scored for an allele chat -was not 
represented by cither of the seed sources that represented the parents. The maize SSR data had a . 
mean of 0.7% missing data (only three genot>pes had missing data; these were at elevated levels 
of 5%, 9%, and 36%). A mean of 6.4% parent/progeny triplets (range 4-7%) had SSR progeny 
profiles that did not share an allele with either of the seed sources that were available to represent 
the original parental genotypes. ^ 

Probability of ancestry applied to soybean data: Figures 1 and 2 present the probabiUties of 
closest ancestry of the top ranking varieties for each of 43 soybean varieties using data from. 236 
marker loci atp = 0.50 (Fig 1) and at/? = 0.99 (Fig 2). 

Waen the algorithm was used zip ^ 0.5 with data from all 236 loci (Fig 1), then 24/43 (56%) of 
index varieties had both parents coirectly identified in the top two ranked positions, 12/43 (28%)^ 
had one parent correctly placed in one of the top two positions, and 7/43 (16%) had none of tlie 
actual parents assigned into the top two ranked positions. Thus, when p - 0.5 was used> 60/86 
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(70%) of actual parental varieties were correctly ranked in the top cw'o positions and 26/86 (30%) 
were incorrectly placed in lower positions. 

When the algorithm was used at/? = 0.75 with data from all 236 loci (data not shown), 28/43 
(65%) of index varieties had both parents correctlytidentified in the top nvo ranked positions, 
1 1/43 (26%) had one parent correctly placed in one of the top two positions, and 4/43 (9%) had 
^none of the actual parents assigned into the top h^'o ranked positions. Therefore, whenp = 0.75 
was used 67/86 (78%) of the actual parental varieties were correctly ranked in the top two 
positions and 19/86 (22%) were incorrectly placedlin lower positions. 

When the algorithm was used at/? - 0.99 with data from ail 236 loci (Fig 2), then 33/43 (77%) of 
actual parental varieties were correctly ranked in the top two positions and 10/86 (23%) had one 
parent correctly placed; all index varieties had at least one parent ranked in the top two positions 
when the algorithm was used at/? = 0.99- With p used at 0.99 then 76/S6 (88%) of actual parental 
-varieties were correctly assigned; 10/86 (12%) were incorrectly assigned, . 

Table 3 presents the rankings, probabilities, and pedigrees of varieties that were incorrectly 
assigned above a true parent. The largest pedigree class (41% of cases where a non-parent ranked ^ 
above a true parent) of non-parents ranking higher than parents was for varieties that are - 
^" denvatives of the parent that was misplaced at a lower ranking. The equal second largest classes 

(each representing 14% of the cases) were for varieties that were (a) full sibs of the true but 

I 

misplaced parent and (b) full sibs of a grandparent' of the vanety for which the pedigree was 
bemg tested. Other categories (percent of cases in parentheses) were: multiple backcross versions 
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of the misplaced parent (7%), a derivative of the vanety or which the pedigree was being tested 
(7%), a half-sib of the true but lower ranked parent (7%), a full sib of the variety for which the 
pedigree was being tested (3%), and a half-sib of the variety for which the pedigree was being 
tested (3%). Insufficiently detailed pedigree infprmation is available tp categon^e one variety 
(3% of cases) diat ranked above the true parent - - 

t* 

Robustness: The quality of soybean SSR data as received from the laboratory, in tenn$ of 
missing data and apparently nOn-Mendelian parent-progeny triplets, have already been presented. 
Taking these data as an initial starting point, additional levels of.missing and mis-typed data 
were created by simulations and used to explore robustness of the algorithm, 

SSR data for five index soybean varieties were used to determine the robustness, of the algorithm. 
Subsets of data were created that included parameters of reduced.numbers of loci, additional, 
levels of missing data, additional levels of mis-typed data, and various combinations of these 
parameters. Simulated levels of missing and mis-typed data were created with a first pass 
creating missmg daia» followed by a second pass creating mis-typed data. Therefore, for 
example, the maximum level of cumulative error from simulated missing and mis-typed data was 
from,36 to 40%. Five varieties were chosen to represent a range of diversity in respect of both 
. pedigree and SSR profiles. Four varieties had no parents or grandparents in common and one 
pair of varieties was related by a common parent. All varieties had parents ranked in the top two 
positions when the algorithm was run at;? =^ 0.75 and p = 0.99. This selection of varieties 
therefore provides a means to establish lower boundaries for-both the quantity and quality of 
SSR data that are required to avoid aberrant results. 
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Table 4 presents the probability of ancestry of the top five ranked varieties for each of five 
selected soybean index varieties (93B1 U A79S6, P9443. S38T3 and Young) when the algonthm 
is run using different numbers of SSR marker loci (50, 100, 150 and 236) afeach of two levels of 
p (0.5 and 0.99). Using p - 0,5, the lowest percentage of parents (60%) -diat were correctly 
ranked into the top two positions corresponded to using only 50 SSR. Increasmg the number.pf 
loci to 100 or 150 or 236 increased the ability to identify the actual parents to about 90%. When 
p was used at a leve! of 0.99 all parents were correctly ranked into the top two positions for each 
of the five varieties when data from as few as 50 SSR loci were used. 

Table 5 summarizes other aspects of robustness. Namely, we simulated additional levels of 
missing, mis-typed and missing plus mis-typed data, beyond those that were inherent m the data 
as provided by the laboratory. When p was used at a level of 0.5, robustness was generally 
maintained up to an additional level of 20% simulated missing data, so long as data from 100 or 
more loci were used. Similarly, robustness was maintained for up to 20% additional mis-typed 
data so long as data from 100 or more loci were used. Likewise, robustness was maintained with 
up to 1 8 to 20% additional levels of data error including both missing and mis-t>ped data, so 
long as data from 150 or more loci were used. Using data for all 236 loci provided a higher level 
of robustness, but even then robustness collapsed when 36 to 40% cumulative additional error - 
from missing and mistyped data were simulated into the analysis. The overall level of correct 
assignation of parent varieties was higher when p was used at a level of,0.99- All parents then 
were correctly identified, i^cn when data from only 50 loci were used up to an additional level 
of 10% missing data. When data from lOO or more loci were used then all parents were correctly 
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identified with up to 20% additional missing data. Robustness started to decline when the 
algonthm was applied with 10% additional mis-typed data when data from 150 or fewer SSR 
loci were used. However, robustness was maintained for up to 20% additional mis-typed data 
when data from 236 SSR loci were used. When additional levels of both incorrect data were 
applied then robustness was maintamed at levels of up to 10% missing plus 10% mis-typed data^ 
so long as data from at least 150 SSR loci were used. Robustness was compromised w^hen 
additional simulations of 20% missing-plus 20% mis-typed data were apphed even .when data 
from all 236 SSR loci were used. • i , 

We then investigated the relationships of varieties to the index genotvpe whose pedigree was 
under examination by rerunning the analysis after both parents of the index genotype had been 
removed from the analysis. Fifteen varieties that had two or more of their grandparents profiled ■ 
in the dataset were used for this examination. After removing parents, direct pedigreed : 
derivatives of the index genotype ranked first for P9583, in the first three places for A2943 and 
in the first six places for P9561 . Once all parents and derivatives of the index genotype had beea 
removed from the analysis then the following results were obtained. Predominant classes of 
varieties ranking in the top five positions were (percent of cases in parentheses); derivatives of 
llie ^aadpaient of the index variety (32%), grandparents of the index variety (16%), derivatives 
of the parents of the index variety (16%), and half-sibs of the index variety (13%). Grandparents 
ranked among the first four positions far 10 vaneties and were in the firsX place for five vancties. 
Great-grandparents ranked within the first seven places for three vaneties, and a great-great- 
grandparent ranked in eighth place for one variety. Other varieties that ranked in thcfirst place 
were usually closely related to the variety whose pedigree was under examination; full-sibs and 
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half-sibs were the predominant classes of relatives other than grandparents in the first ranking 
position after parents and direct derivatives of the variety under exanriination had been removed- 
Probability of ancestr>' applied to corn data: The seven index inbreds of maize were selected 
* because they represented'all of the inbred lines published npon by Senior er a/.K1998) that had 
all of their inbred parents also included in the SSR dataset. All of the inbred hne*.publishcd by 
Senior et aL (1998) have well known arfd well established pedigrees that are^fully provided by 
those authors. 

Table 6 presents probabilities of ancestry for the top five ranked inbreds for each of the seven 
index inbred lines at two levels of p (0.5 and 0.99). For the three progeny that were bred from 
single crosses without any subsequent use of one of the parents to make a recurrent cross prior to. - 
inbreeding (Mol7, Va99, and W64A) then use of the algorithm at either/? = 0-5 or atp = 0.99 
resulted in the parental inbreds being ranked in first and second positions. Use of the algorithm at 
p ^ Q 99 provided greater discrimination for probabilities of ancestry that were assigned to actual 
parents compared to highest ranking non-parents. This was most noticeable for the case of inbred 
Va99 which had a relatively low value when used ^ip^ 0.5 for parent 2 (0.5221) compared to 
parent I (0.9999) or to the third ranked inbred (and non-parent), Va22 (0.4252). hi contrast, 
when.the program was run at /? = 0.99 then-parent 1 and parent2 for Va99 bad probabilities of 1 
and 0,9855, respectively, with the probability of the third ranked inbred being 0.0131. 

For each of the three progeny inbreds that onginated from breeding schemes that involved one or 
more additional crosses of one of their parents, using the algorithm atp = 0.5 resuhed in 
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placement of the respective recurrent parent with the highest probability of ancestry. Raising the 
level ofp to 0.99 resulted in both parents (B 1 4 - recurrent parent and MT42 the non-reciurent 
parent) of the index inbred A632 being ranked in the top two places. Using this level ofp also 
caused a higher ranking (third position) For the non-iBcurrent parent (MT42) of index inbred 
A634. Use ofp at 0.99 did not cause the non-reeunent parent (C103) of index inbred (Va35) to 
rank into the top five places. ' - ^* 

For the index inbred (Pa91) that was bred from a more complex .cross involving four inbred 
lines, the use ofp at 0.5 or at 0.99 resulted in the two parents (WF9 and Oh40B) being ranked in 
second and third places; highest ranked was inbred Va99 (Va99 is derived from the index inbred 
Pa91). Neither of the two remaining parents of Pa91 ranked in the top five places. 

DISCUSSION 

The current widely used North American soybean vaneties are founded upon a relatively narrow 
genetic base of diversity. Gi^lice et aL (1994) document that the U, S- soybean germplasm base 
is foimded upon 20 plant introductions and that subsequent breeding has made repeated use of 
related parents. Molecular marker compansons of elite U. S. soybean varieties compared to a 
sample of exotic varieties reinforce the conclusion that there is a relative paucity of genetic 
variation in U. S. soybeans. Narvel et aL (2000) have shown that the number of alleles detected 
among the exotics was 30% greater than among U. S. varieties. Thompson and Nelson (1998) 
report that very little exotic germplasm has been incorporated into the existing U. S. soybean 
germpiasm base. Examining all pairs of pedigree relationships among the 490 soybean varieties 



16 



Received from < 515 3^ 6883 > at 2/28/03 1:44:35 PM [Eastern Standard Time] 




02 '28/03 FRI 13:59 FAX 515 334 6883 PIONEER HI-BRED DSM ^0$6 

employed in this study showed that approximately 50% of painvise relationships are related at 
the level of half-sib or closer; approximately 10% of pairs are related at the level of full-sib or 
closer. This set of soybean varieties therefore provides the basis for an extremely ngorous 
evaluation of the ability of SSR data to distinguish between varieties and of this algorithm to 
identify pedigree?. Pedigree breeding, including the use of related parents, is also commonly 
applied in the breeding of maize inbred lines. The set of maize mbreds used here thus also 
provides a meaningful evaluation of the marker data to discriminate among inbred Imes and qf 
the joint ability of the algorithm and of the marker data to allow a determination of inbred 
pedigrees. 

Use of the algorithm at p = 0.99 rather than at a lower level miproved performance in terms of 
the percentage of correct assignations of parents and provided a greater statistical differential for ' 
probabilities for parents in comparison to the highest ranking non-parents. Use of the algorithm 
31 0.99 IS more appropriate when it is known that the actual parents of the variety under 
examination are included among the set of index varieties. If it is not known that the parents are 
included m the mdex set then use of the algorithm at;? = 0.5 is more jusnfied (Berry el al. 2002). 
For the soybean varieties, when p was used at 0.99, then 77% of all varieties that were queried 
for their parents had both parents correctly identified. Eight-eight percent of soybeap parents 
were correctly identified across 43 index varieties that were queried for their parents. All 
varieties (with the possible exception of one variety where detailed pedigree information was not 
available) that ranked above true parents were related either to the mis-ranked parent or to the 
variety th:it was bemg queried for its pedigree. Our previous report of the use of an* algorithm to 
determine hybrid pedigrees (Berry et al. 2002) showed a higher level ofcTorrec: parental 
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detenninaiions at p = 0-99. Many of these 50>'bcan vanetics have a high degree of pedigree 
relatedncss. However, many of the maize inbred lines that were used in the previously reported 
study (Berry ei al. 2002) were also highly related. It is, however, likely to be inherently more 
challenging to correctly identify parents following cycles of inbreeding because half of thp 
alleles that are segregating in^*e first generation followhig the.initial breed ing^cross will be 
subsequently lost as recurring cycles of self-fertilization occur. Thus, many of the alleles that arq 
present m a hybrid, and which can therefore contribute to the identification of its pedigree, do not 
remain present in an inbred homozygous progeny. 

We examined the pedigrees of soybean index varieties when both parents ofthe index had been 
removed from the set of candidate varieties. Direct pedigree dcscendents wi-th the mdex variety, 
as one parent then usually ranked higher than other varieties, including varieties that were 
grandparents or sister varieties ofthe index variety. When all parents and direct derivatives ofthe 
index variety were excluded from the analysis then the predominant classes of varieties ranking 
in the top five positions were derivatives ofthe grandparent ofthe index variety (32%), 
grandparents ofthe mdex variety (16%), derivatives of the parents of the index variety (16%), ' 
and half-sibs oFthe index variety (13%). The SSR data that were available to us .did not allow a 
thorough or very precise assessment of how varieties with different degrees of relatedness would 
rank as members of the pedigree in the event that the true parents were not present in the 
database. Nonetheless, when parents were excluded from the analysis then ,Y^elies that were 
very closely related to the index variety ranked highest. Direct descendents dependent for their 
pedigree upon the index variety, if present, tended to rise above varieties included widiin other 
classes of pedigree relationship to the index variety. WTien varieties directly descended by 
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pedigree from the index variety were also excluded then a grandparent ranked into first position 
for 33% of the varieties that were examined. Direct pedigree derivatives of one or more of the 
parents of the index variety had an equal level of occurrence when parents and denvativcs of the 
index variety were excluded. Further investigations of the identification of grandparents will 
require a dataset including all grandparents of each index vanety and will also require a revised 
algoritlim to take account of pedigree contributions from four varieties as opposed to pairs of 
varieties which forms the basis of the current inbred algorithm. • ■ 

For the maize inbred line pedigrees, use of the algorithm either at/? = 0.5 or atp = 0.99 resulted 
in the correct identification of both parents in all cases where the breeding scheme was an initial 
cross of Uvo parental lines followed by subsequent cycles of inbreeding (i.c.for the inbreds 
Mol7, Va99 and W64A). The relatively higli level of robustness for results with raaize inbreds at 
p = 0.5, in contrast to the results obtained from analyzing soybean data (where 56% of varieties 
had both parents correctly identified when p = 0.5 was used) could be accounted for by the 
smaller sample size of maize inbreds and by the" lower degree of mean pedigree relatedness 
amongst this selection of inbred lines in comparison to the soybean vanetjes. Thus while several 
inbred lines in this set are closely related, there remain many inbreds that have little or no 
pedigree relationship (Senior er a/. 1998). 

The inbred algonthm correctly identified both parents of the three maize index inbreds that h^d 
been bred from bi-parental crosses that involved equal contributions (by pedigree) from both 
parents. For the three bi-parental crosses that involved subsequent additional crosses of the 
recurrent parent (and thus significantly biased contnbutions by pedigree to the ir.dex vanety 
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fixim the recurrent parent) then use of the algorithm correctly identified each of the recurrent 
parents. The algorithni was unable to identify the non-recuirent parent in most cases, but this 
result would be expected because one backcross reduces the expected pedigree contribution of 
the non-recurrent inbred to 25%. More generations of backcrossing using the recurrent parent 
then further reduce the expected pedigree TOntribution of the non-recurrent parent by half at each — 
' generation (successively to 12.5%, 6.25%, 3.125%) with the pedigree contribution of the - 
recurrent parent rising accordingly. Since several inbred lines of maize are related by pedigree 
then it is not surprising that the level of pedigree or SSR similarity of a non-recurrent parent to 
the index progeny can fall below other inbred lines that are related to the index variety. The - 
algorithm was not able to preferentially, identify parents of the inbred line Pa9l, which was bred 
from a complex breedmg scheme mvolvmg four parents with equal contributions by pedigree. A 
more suitable algonthm is needed to take account of four way crosses. However, such a need is 
primarily academic because most breeding crosses in commercial maize breeding, and indeed for 
most crops, are bi-parental. 

These soybean data had a mean of 5,5% missing data per variety and a mean of 1,1% loci where a^- 

a progeny was scored with an allele that was not also scored in either or both parents. Such 

apparent non-Mendelian or exclusionary profiles can be due to pollen contannnation.dunng - ' 

inbreeding, cross contamination in the field or laboratory, scoring errors in the. laboratory (e.g. 

scoring +A, predominant stuttering^ spectral pull ap> secondary binding sites or polymer spikes), 

or incorrect pedigrees. Another source of apparent exclusion is through jthe use of a seed soiu-ce - ^ 

as a parent that is still heterogeneous due to inbreeding being incomplete. Cycles of inbreeding 

then continue so that when those seed sources are used in the future as sources for SSR profiling 



20 



Received from < 515 334 6883 > at mm 1 :44:35 PM [Eastern Standard Time] 




02'2S/03 FRI 14:00 FAX 515 334 6S83 PIONEER HI-BRED DSM ^080 

to represent the parental genotype they will have lost alleles due to inbreeding that have already 
been passed on to a progeny Aitemaiely, residual heterozygosity within seed sources can result 
in low frequencies of heterozygotes or off-type scgregants which may, by chance, be sampled in 
the progeny, but not sampled in the parent. In this study we sampled six plants to represent the 
variety which may be insufficient to capture alleles existing at low frequencies withm the seed 
source. And even if the allele was sampled, it may not have been detected following PGR 
amplification due to predominance of the most frequent allele and allelic competitionreffects. 
Hall (2002) has also reported the occurrence of apparent non-parental SSR alleles. Mutation can 
also affect SSR profiles. Vigouroux et ai (2002) have estimated mutation rates of 7.7 x 10"^ per 
generation for dinucleotide SSRs and an upper 95% confidence limit of 5.1.x 10"' for SSRs with 
longer repeat units. A level of error or discrepancy in expected SSR profiles are thus inevitable- 
for some, if not all crop plants. We therefore evaluated the robustness of the. algorithm and ■ 
dataset by rcnmning the algorithm using datasets that were simulated to have up to 20% 
additional levels of missing plus 20% mis-typed data beyond the level that was received from the 
laboratory. The algorithm maintained its initial level of robustness with up to an additional level 
of 10% both missing and mis-typed data, provided data from at least 100 SSR loci were used. 
Fewer loci (60) were capable of retaining this degree of robustness in the evaluation of the 
hybrid pedigree algorithm using maize hybrids (Berry et al. 2002). The loss of parental alleles 
that occurs dunng the inbreeding process, in contrast to their retention in ahybrid progeny - - 
compared to its parents, probably underlies the neef to use data from a greater number of loci to 
maintain robustness for the inbred algorithm as compared to the hybrid algorithm. 
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It was anticipated that determination of pedigrees following cycles of inbreeding might be more 
challenging to accomplish than to determine pedigrees of hybrids where the total nuclear genetic 
contributions of both parents are preserv ed. Nonetheless, these results show that the algoriilim 
can be used effectively to identifying parents of inbred genotypes. Nearly 90% of soybean 
parents were identified. This is a set of genotypes which, due to the relatively narrow^o under ., 
base and subsequent cycles of development through the use of related crosses, provides an 
extremely rigorous test of the algorithm and of the discriminatory power of the marker data. 
Supplementary data also show the capability of the algorithm to identify parents of maize inbreds 
that have been developed in a pedigree system using nvo parents. Use of this algorithm with • 
currently available codommantly expressed molecular marker data has also been shown to.have 
practical feasibility because of the high degree of robustness that is evident and which extends 
well beyond the realm of aberrant or unexpected marker data that is encountered. These types of 
error or unexpected marker data can include laboratory error, sampling effects or the use of.,, 
different seed sources for the actual parental source compared to a more inbred source that 
becomes available later to represent the parental genotype. This algorithm has application in a 
number of fields, including conservation biology, population genetics, and to assistin the 
protection of intellectual property rights. -vv 
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Table 1. Calculations of ancestry for homozygous index inbreds: Cases that must be considered 
for example of genotype aa. 



SSR 


Index 


Inbred i 


Inbred j 


1 


aa 


aa 


Aa 


2 


aa 


aa 


Ax 


3 


aa 


aa 


Xx 


4 


aa 


ax 


Ax 


5 


aa 


ax 


Xx 


6 


aa 


XX 


Xx 



X is any allele different from a, but not missing 
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Table 2 Probability of obsemng the index [P(SSR\IJ)] assuming inbreds i andy are ancestors: 
Calculations for SSRs 1 to 6. 







1 


p-(4/4) + p(l-p)(l/2+l/n*l/2) + p(l-p)(l/2+l/n''l/2) ^ (l-py(J/n) \ 


2 


p-'(3/4) ^ pfl-p)(l/2+l/n*l/2) + p(l-p)(l/2*l/2+lJn*I72) + (1- 
pf(]/n) 


3 


p-(2/4) - p(l-p)(l/2+l/n*l/2) + p(l-p)fl/n*l/2) + (J-p/(l/nJ 


4 


p-'(2/4) + p(l-p)(!/2*l/2+l/n*l/2) + p(]'p)(l/2*l/2 + l/n*l/2) +.(1- 
p)-(l'n) 


" 5 p'a/4) + pd-pXl/l'in+l/n*!/!) - p(l-p)(l/n*l/2) -r (l-prfl/n) 


6 p'fOM) ^ p(l-p)(l/n*l/2) + p(l^p)(lM*}/2) + (J-p/{]/nj 



The four terms in each case are in order of the four possibilities when inbreds r and j are 
ancestors: (1) the alleles of both / and j were passed to the intermediate hybrid, (2) / came 
through but not (3)y came through but not i, and (4) neither came through. Missing alleles are 
not considered- 
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Table 3, Probabilities of ancestry and pedigree relationships for soybean varieties where both 
parents did not rank above non-parents. 



Case no. Index variety Rank Possible ancestor Probability 



1 


95B97 


1 


Parent 2 


1 

1 






2 


Full sib of parent 1 


0.5822 






3 


Parent 1 


0,4124 


2 


A2945 


1 


Parent 1 


0 0Q77 






J 


Multiple DacKcross oi parcni z 








3 


rarent L 


n 1 QQQ 


3 


A 4:) 93 


1 
1 


Parent 1 


1 






2 


Derivative of parent 2 








>•> 
3 


Multiple baclccross oi parent z 








4 


Derivative oi rdreni z 


0 0006 






5 


Halfsibof A4595 


0.0004 






6 


Parent 2 


0.0001 


4 


T T t 

Hark 


1 


Parent 1 


1 
1 






2 


Derivative of parent 2 


1 

y 






3 


Derivative of parent 2 


2.1E-09 






4 


Derivative of parent 2 


1.4E-09 






5 


Derivative of Hark 








6 


Derivative of parent 2 


1 1 p 1 T 






7 


unknown 


3.8E-15 






8 


Derivative of parent 2 


4.6E-17 






9 


Derivative of parent 2 


/ IZ-Z 1 






10 


Parent 2 


1 7P 1 1 

Z. I lL-Z I 


5 


Kent 


1 


Parent 2 


1 






L 


Derivative of parent 1 








-1 
J 


l^CllVOllVCUl puiUiiL 1 


0.0011 






4 


Parent 1 


3.0E-04 


6 


P9583 


1 


Parent 1 


1 






2 


FullsibofP9583 


0.8801 






3 


Parent 2 


0.1199 


7 


P964I 


1 


Parent 2 


1 






2 


Derivative of P9641 


1 






3 


Parent 1 


3.7E-06 


8 


S^0J2 


1 


Parent 1 


1 






2 


Derivative of parent 2 


0.9321 
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3 Parent 2 0.0679 

9 YB30K0I 1 Parent 2 1 

2 Half sib of parent 1 I 

3 Full sib of parent 2 7.9E-09 

4 Half sib of parent 2 3-3E-09 

5 Full sib of grandparent L2E-10 

6 Deriv-ative of parent 1 3.0E-11 

7 Full sib of parent 2 2.0E-11 

8 Full sib of grandparent 8.7E-12 

9 Parent! ' l.lE-12 

10 YB41Q01 1 Parent 2 1 

2 Full sib of parent 1 1 

3 Full sib of grandparent 7.3E-05 

4 Full sib of grandparent 4.1E-09 

5 Parent 1 9.1E-10 



Results for 33 (77%) varieties where both parents were ranked first and second are not included 
in this table (see Figures 1 and 2). 
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Tabic 4. Probability of ancestry for five individua! 
from different numbers of loci (50, 100, 150.236). 



soybean varieties using SSR data obtained 





T sn 




LlOO 




L150 




L236 




Inbred 


Possible ancestor 


Prob 


Possible ancestor 


Prob 


Possible ancestor 


Prob 


Possible ancestor 


Prob 


P=0.5 
9381 1 


XB3J C 






1 


XB3fC 


1 


XB3IC 


1 




A34 15 


U.QUVJO 




0.9362 


A3415 


0.9146 


A34IS 


0-9954 




XB^oAOI 


U.UZ_)0 




0.0429 


WILLIAMS 


0.0809 


WILLIAMS 


0.0046. 




F92/ 1 




ziD A *t X 


0.0155 


YB30L0I 


0.0034 


A3242 


0 




YBSuLOi 




I D JUJ-iU 1 


0.0015 


A3242 


0.0006 


DOUGUS . 


0 


A79S6 


COOK 


\J. f t 40 


ftp A YTn\^ 


0.9725 


BRAXTON 


I 


BRAXTON 


1 




wci jc "J r\r\r\ 


Vj.-i?S4 L 




0 5302 


YOUNG 


0.8910 


YOUNG 


0.9929 




S6262 


yj, lo-to 




0 3S72 


P964 1 


0.0404 


XB63D00 


0.0071 




YOUNG 


U. 1 (J J 






XB63 DOO 


0.0254 


96B32 


0 




pHAa I Uf^ 


C\ 1 




0 0328 


COOK 


0.0245 


P964I 


0 


P9443 


DOUG LA J 








FA YETTE 


0.576O 


FAYETTE 


0.9S85 




A34/5 


v. /o/y 


r A 1 iL lit. 




A3415 


0.7034 


DOUOL4S 


0.8847 




U fT T T r 4 i J'C 

yVfLLfAMjn 


U.USiS / 


r\r\j jfzi A c 




CX399 


0.1671 


A3415 


0.0846 




jALL 






0.2032 


CX260C 


0J273 


WILLIAMS 


0.0348 






n 041 1 


win FA MS 

rrl Iji^i. rt Art 


0.1608 


WILLIAMS 


0.0948 


CX399 


0.0062 


COS TV 


J J J J 


0.871 1 


S3 5 35 


0 9993 


S3535 


1 


S3 5 35 


1 




S4644 


0.4543 


S4644 


0.99SS 


S4 644 


1 
I 




\ 




YB44R01 


0.2762 


yB40M01 


0.0012 


YB17Y00 


0 


A4263 


0 




YB40M0I 


0,1087 


YB44R0I 


0 0004 


93865 


0 


YB44R0I 


0 




YB44Q0I 


0.0325 


¥837 YOG 


0.000 1 


Aa26S 


0 


YB37Y00 


0 


YOUNG 


DA VIS 


0-6589 


DAVIS 


0 6551 


DA VIS 


0.6324 


DA VIS 


0.9752 




XB63D00 


0.4942 


ESSEX 


0 59'9 




0.552-1 


P96n 


0.5397 




96B32 


0.3122 


P9641 


0.3409 


COOK 


0.323 1 


ESSEX 


0.3273 



29 



Received from < 515 334 6883 > at 2/28i'03 1 :44:35 PM [Eastern Standard Time] 



02/28/03 FRl 14:02 FAX 515 534 6aS3 



# 

PIONEER HI -BRED DSM 



2)069 





COOK 


0,0101 COOK 


0.1692 


ESSEX 


0.2817 


96B32 


0,12 




OGDEN 


0.0606 96B32 


0.1315 


96B32 


0.1933 


COOK 


0.02 


p=0.99 
















9381! 


X3SIC 


\ XB31C 


I 


XB3IC 


1 


XB3JC 


1 




A34I5 


0.9999 A34I5 


0.9999 


A34I5 


1 


A34J5 


1 




A3242 


0.0001 A3242 


0.0001 


P9443 


0 


WILLIAMS 


0 




P9443 


0 P9443 


0 


A 3 242 


0 


A3242 


0 




WILLIAMS 


0 WILLIAMS 


0 


WILLIAMS 


0 


FA YETTE 


0 


A7986 


BRAXTON 


1 BRAXTON 


1 


BRAXTON 


1 


BR.4XTON 


1 




YOUNG 


0 9903 YOUNG 


0.9903 


YOUNG 


0.9987 


YOUNG 


1 




P964J 


0.0092 P9641 


0.0092 


96B32 


0.0012 


XB63D00 


0 




96B52 


0 0005 96B32 


0,0005 


P964I 


0.0002 


96B32 


0 




OA VIS 


0 DA VIS 


0 


DA m 


0 


P964I 


0 


P9443 


DOUGLAS 


0.9998 DOUGLAS 


0-9999 


FAYETTE 


0.9995 


DOUGLAS 


1 




FA YETTE 


0.7010 FAYETTE 


0.7011 


DOUGLAS 


0.9993 


FAYETTE 


1 




CX260C 


0.2345 CY2(5(?C 


0.2345 


CX399 


0.0006 


CX260C 


0 




A3^15 


0,0644 A3415 


0.0643 


A3415 


0.0005 


CX399 


0 




S3 041 


0.0001 AP3330 


0.0001 


P9394 


0.0001 


43415 


0 


S3ST8 


S3335 


1 


1 


S3535 


1 


S3535 


1 




S4644 


1 5^tf^^ 


1 


S4644 


I 


S4644 


1 




YB40MOI 


0 YB40M0I 


0 


93B67 


0 


A4268 


0 




YB44R01 


0 


0 


ST37SO 


0 


YB54J0O 


0 




93B67 


0 YB44R0I 


0 


YB37Y00 


0 


YB44R0I 


0 


' YOUNG 


DA VIS 


1 


1 


DA VIS 


I 


DAVIS 


1 




ESSEX 


1 £55£vV 


1 


FSSKK 


1 


ESSEX 


1 




F964I 


0 P964/ 


0 


COOK 


0 


S4240 


0 



30 

Received fronK 515 334 6883 > at 2/28/03 1 :44:35 PM [Eastern Standard Timel 



02.'28/03 FRI 14:11 FAX 515 334 6883 PIONEER HI -BRED DSM il092 



Attorney Docket No. 1335 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant: Philip Richard Martin 

Serial No.: 09/759,758 

Filed: January 12. 2001 

For: - "INBRED MAIZE LINE PH5HK" 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
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RULE 132 DECLARATION 
OF 

DR. DINAKAR BHATTRAMAKKI 



Sir 



I, Dinakar Bhattramakki, Ph.D., do hereby declare and say as follows: 



1. I am skilled in the art of the field of the invention. 1 have a Ph.D. in. Plant Molecular 
Genetics from the University of Illinois at Urbana^Champaign. I have a Bachelor of Science 
degree in Agricultural Sciences from the University of Agricultural Sciences, Bangalore. India. 
Since 1 997 I have been engaged in the analysis of molecular markers for plants. , I have - 
supen/ised the Molecular Marker Applications lab at Pioneer Hi-Bred International. Inc. from 
January 2002 until the present. 

2. I am familiar with the methods used in the analysis of Simple Sequence Repeat. SSR, 
marker data for inbred PH5HK conducted at Pioneer Hi-Bred International. Inc. The analysis 
of the SSR profile of inbred PH5HK may be accomplished without any undue- 

experimentation. The SSR profile for inbred PH5HK is attached hereto. . 

3. Means of performing this genetic marker profile are well known in the art SSRs are 
genetic markers based on polymorphisms in nucleotide sequences. The PCR^" detection of 
SSRs is accomplished by using two oligonucleotide primers flanking the polymorphic 
segment of DNA. Amplification is accomplished through repeated cycles of heat denaturation 
of the DNA followed by annealing of the primers to their complementary sequences at low 
temperatures, and extension of the annealed primers with DNA polymerase. 

4. Markers are scored following amplification and gel electrophoresis of the ampimcation 
products. Scoring of marker genotype is based on the size or weight of the amplified ^ 
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fragment. While variation in the primer used or in laboratory procedures can affect the 
reported marker score, relative values remain constant regardless of the specific primer or 
laboratory used. 

5. Primers that may be used to Tdentify the SSR markers reported herein are publicly 
available and may be found in the Maize DB on the World Wide Web at 
agron.missouri.edu/maps html (sponsored by the University of Missoun"), in^haropova'et al. 
(Plant Mol. Biol. 48(5^):463-481 ) and/or in Lee et al (Plant Mol. Biol. 48(5-6); 453^61). 
Markers shown for PH5HK are the^publidy available markers in the sources listed above for 
which PH5HK was tested and shown to be homozygous. 

6. Map information is provided by bin number as reported in the Maize DB. The bin 
number digits to the left of decimal point typically represent the chromosome on which such 
marker is located, and the digits to the right of the decimal typically represent the location on 
such chromosome. 

7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true;, and further that 
these statements were made with the knowledge that willful false statements and the like are 
punishable by fine or imprisonment, or both, under Section 1 001 of Title 1,8 of the United 
States Code and that such willful false statements may jeopardize the validity of the. 
application or any patent issued thereon. 

Date: Febmaty 28. 2003 By:_ 




•v... 



Dinakar Bhattramakki 
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1.01 249.32 

1.02 114.36 

1.02 212.96 

1.03 232.00 
1.03 142.91 
1 03 145.56 
1.03 306.51 

1.03 143.49 

1.04 217.79 

1.05 215,70 
1.05 139.18 



Public Name of PH5HK 
Marker Bin* base pairs 

phi4279l3 101 129.38 

bnlgl014 1.01 124-16. 

phi056 

bnlg1127 ' 
bnlg1429 
bnlg439 
phil 09275 
phi339017 
bnlg1203 
bnlg1484 
bnlg2086 
bnlg1832 
bnlg1886 
bnig1057 
bnlg1041 
bnlg1615 
bnlg1556 
phi323065 
phi335539 
phi423298 
phi002 
bnlg1331 
phiOII 
phi308707 
phi227562 
phi265454 
phi064 
pbi96100 
bnlg1017 
bntg1064 
bnlglOIS 
bnlg1138 
bnlg1396 
bnlg1831 
phi251315 
phi328l89 
phi427434 
phi435417 
bnlg1141 
bnlg1940 
phi127 
bnlg1520 
phi101049 
phi453121 
phi104127 



1.06 


249.42 


1.06 


240.52 


1.06 


232.40 


1.07 


194.17 


1.08 


329.53 


1.08 


88.58 


1.08 


127.73 


1 08 


69.78 


1.09 


119.11 


1.09 


212.44 


1.10 


131.11 


1.11 


306.93 


1.11 


217.66 


1.11 


94.65 


2.01 


284.43 


2.02 


175.37 


2.03 


188.18 


2.04 


134.33 


2.06 


218.34 


2.05 


133.20 


2.06 


189 90 


2,07 


123-79 


2.08 


121.35 


2.08 


124.37 


2.08 


214.27 


2.08 


152.92 


2.08 


247.33 


2.08 


122 04 


2.09 


285.85 


2.10 


234.30 


3.00 


212.06 


3.01 


162.19 
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phi4O4206 3.01 303.14 

phi193225 3.02 133-45 

phi374118 3.02 225.95 

bnlg1144 3.02 115.08 

bnlg1647 3.02 128.99 , 

bnlg1523 3.03 318.23 

bnl9l019 3.04 158.83 

bnlg1113 3.04 93.98 

bnlg1452 3.04 107.48 

bnlg1035 3.05 101.06 

phi053 3 05 191.64 

phil 02228 3.06 135.04 

bnlglieo 3.06 222.23 

bnlgl951 306 125.99 

bnlg2241 3.06 142.63 

phi072 4.00 160.51 

phi213984 4.01 284.60 

phi295450 4.01 184.83 

bnl9l162 4.03 104.11 

phi30809O 4.04 218.64 

phi096 4.04 234.58 

phi438301 4.05 211.81 

bnlg1159 4.05 147.81 

bnlg1755 4.05 214 54 

bnlg1937 4.05 237.92 

bnlg1265 4.05 202.06 

phi079 4.05 185.40 

bnlg1189 4.07 127.93 

bnlg2244 4.08 206.26 

bnlglOOe 6.00 216.82 

phi396l60 5.02 298.12 

phi109188 5.03 167.28 

bnlg653 5.04 153.96 ' 

phi330507 5.04 131.59 

phi331888 504 130.72 

bnlg1208 5.04 1 34.77 

bnlg1892 504 154.18 

phi333597 5.05 213.82 

phi085 5.06 256 26 

bnlg1118 5.07 64.10 

bnlg1711 507 178.63 

phi423796 6.01 135.33 

phi389203 6-03 306.15 

phi452693 6.04 123.30 

phi445613 6.05 96.95 

bnlg1174 605 218.72 

phi299852 6 07 119.55 

phi364545 6.07 125.26 
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bnlg1759 
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6.07 


oI.UD 
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7 02 
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7.03 
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7 04 
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7.05 
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7.05 
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7.06 
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8.00 
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bnlg1194 


8 02 


140.77 
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8.03 
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8.03 
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8.03 


97.90 
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8.04 


320.35 


bnlg1176 


8.05 
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8.06 
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bnl9l065 


8.07 


228.48 
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8.08 


94.48 


phiOIS 


8.08 
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8.09 


136.10 


bnlg2122 


9.01 


219,19 


bnlgl012 


9.04 


163.61 
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9.04 


236.63 


phi10S411 


9.05 


122.38 


phi236554 


9,05 


122.93 


bnlg1129 


9.08 


294.80 


phi041 


10.00 


202.94 


phi96342 


10.02 


252.35 


phi059 


10.02 


153.35 


bnlg1D79 


10,03 


170.34 


phiO50 


10.03 




phi30l654 


10.04 


128.33 


phi062 


10.04 


15107 


phi323152 


10.05 


134.72 


bnlg1074 


10-05 


177.82 


bnlg1185 


10.07 


151.61 


bnlg1450 


10.07 


189.35 


phi109642 


2.03/2,04 


137.61 


bnlg1720 


1.09/1.10 


226.57 


phi448880 


9.06/9.07 


186.11 
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